Analyzing 4 Million Real-World Personal Knowledge Questions (Short Paper)
نویسندگان
چکیده
Personal Knowledge Questions are widely used for fallback authentication, i. e., recovering access to an account when the primary authenticator is lost. It is well known that the answers only have lowentropy and are sometimes derivable from public data sources, but easeof-use and supposedly good memorability seem to outweigh this drawback for some applications. Recently, a database dump of an online dating website was leaked, including 3.9 million plain text answers to personal knowledge questions, making it the largest publicly available list. We analyzed this list of answers and were able to confirm previous findings that were obtained on non-public lists (WWW 2015), in particular we found that some users don’t answer truthfully, which may actually reduce the answer’s entropy.
منابع مشابه
What’s in a Name? Evaluating Statistical Attacks on Personal Knowledge Questions
We study the efficiency of statistical attacks on human authentication systems relying on personal knowledge questions. We adapt techniques from guessing theory to measure security against a trawling attacker attempting to compromise a large number of strangers’ accounts. We then examine a diverse corpus of real-world statistical distributions for likely answer categories such as the names of p...
متن کاملGetting There First: Real-Time Detection of Real-World Incidents on Twitter
Social networking and micro-blogging services such as Twitter have become a valuable source of information on current events. Widespread use of Twitter on mobile devices and personal computers enables users to share short messages on any subject in real-time, thus making it suitable for early detection of unexpected events where fast response is critical. In this paper, we present an online met...
متن کاملA practical approach for content mining of Tweets.
Use of data generated through social media for health studies is gradually increasing. Twitter is a short-text message system developed 6 years ago, now with more than 100 million users generating over 300 million Tweets every day. Twitter may be used to gain real-world insights to promote healthy behaviors. The purposes of this paper are to describe a practical approach to analyzing Tweet cont...
متن کاملEvaluation of PICO as a Knowledge Representation for Clinical Questions
The paradigm of evidence-based medicine (EBM) recommends that physicians formulate clinical questions in terms of the problem/population, intervention, comparison, and outcome. Together, these elements comprise a PICO frame. Although this framework was developed to facilitate the formulation of clinical queries, the ability of PICO structures to represent physicians' information needs has not b...
متن کاملKnowledge Verification for LongTail Verticals
Collecting structured knowledge for real-world entities has become a critical task for many applications. A big gap between the knowledge in existing knowledge repositories and the knowledge in the real world is the knowledge on tail verticals (i.e., less popular domains). Such knowledge, though not necessarily globally popular, can be personal hobbies to many people and thus collectively impac...
متن کامل